PREFERENCE FOR CONDITIONED REINFORCEMENT
نویسندگان
چکیده
منابع مشابه
Conditioned Place Preference and Aversion
The use of a virtual reality environment (VRE) enables behavioral scientists to create different spatial contexts in which human participants behave freely, while still confined to the laboratory. In this article, VRE was used to study conditioned place preference (CPP) and aversion (CPA). In Experiment 1, half of the participants were asked to visit a house for two minutes with consonant music...
متن کاملConditioned reinforcement and response strength.
Stimuli associated with primary reinforcers appear themselves to acquire the capacity to strengthen behavior. This paper reviews research on the strengthening effects of conditioned reinforcers within the context of contemporary quantitative choice theories and behavioral momentum theory. Based partially on the finding that variations in parameters of conditioned reinforcement appear not to aff...
متن کاملTriadimefon supports conditioned cue preference.
Triadimefon (TDF) is a fungicide with effects similar to cocaine, suggesting potential for abuse. Mice were trained in an apparatus with two distinctive flooring cues. In the experimental group, one of these was paired with administration of TDF and the other with vehicle; in the control group, both flooring types were paired with vehicle. The experimental group showed a significant preference ...
متن کاملPreference-Based Policy Iteration: Leveraging Preference Learning for Reinforcement Learning
This paper makes a first step toward the integration of two subfields of machine learning, namely preference learning and reinforcement learning (RL). An important motivation for a “preference-based” approach to reinforcement learning is a possible extension of the type of feedback an agent may learn from. In particular, while conventional RL methods are essentially confined to deal with numeri...
متن کاملPreference-based Reinforcement Learning
This paper investigates the problem of policy search based on the only expert’s preferences. Whereas reinforcement learning classically relies on a reward function, or exploits the expert’s demonstrations, preference-based policy learning (PPL) iteratively builds and optimizes a policy return estimate as follows: The learning agent demonstrates a few policies, is informed of the expert’s prefer...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of the Experimental Analysis of Behavior
سال: 1991
ISSN: 0022-5002
DOI: 10.1901/jeab.1991.55-37